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Abstract 

I study the recurrence D{n) = D(D(n — 1)) + D(n — 1 — D{n — 2)), 
-D(l) = D(2) = 1. Its definition has some similarity to that of Con- 
way's sequence defined through a(n) = a{a(n — 1)) + a(n — a(n — 1)), 
a(l) = a(2) = 1. However, in contradistinction to the completely 
regular and predictable behaviour of a(n), the D-numbers exhibit 
chaotic patterns. In its statistical properties, the -D-sequence shows 
striking similarities with Hofstadter's Q(n)-sequence, defined through 
Q(n) = Q(n - Q(n - 1)) + Q(n - Q(n - 2)), Q(l) = Q(2) = 1. 
Compared to the Hofstadter sequence, the Z)-recurrence shows higher 
structural order. It is organized in well-defined "generations", sepa- 
rated by smooth and predictable regions. The article is complemented 
by a study of two further recurrence relations with definitions similar 
to those of the Q-numbers. There is some evidence that the different 
sequences studied share a universality class. Could it be that there 
are some real life processes modelled by these recurrences? 



I OFFER A CASH PRIZE OF $100 TO THE FIRST PROVIDING A 
PROOF OF SOME CONJECTURES ABOUT D(n) FORMULATED 
IN THIS ARTICLE. 
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Figure 1: Graph of Q(n). 

1 Introduction 

The recursion relation 

Q{n) = Q(n - Q(n - 1)) + Q(n - Q(n - 2)) for n > 2 , 

(1) 

Q(1) = Q(2) = 1, 

introduced by D. R. Hofstadter in his book Godel, Escher, Bach: an 
Eternal Golden Braid 0, is a challenge [@]. Its apparently chaotic 
behaviour (see figure |I]) is far from being understood. There appear to be no 
rigorous results about the behaviour of Q(n). 

In ref. I reported a number of mainly statistical observations on the 
Q-numbers. The main conclusions were: 

• The sequence shows some signals of order. It is organized in "gen- 
erations", making up for a Fibonacci-type structure on a logarithmic 
scale. 

• The variance of fluctuations around n/2 goes like n a , with a = 0.88(1). 
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• R(n) = (Q(n) — n/2)/n a has a well-defined, strongly non-Gaussian 
probability density p*. 

• There is scaling: x m = R(n) — R(n — m) is distributed according to 
^mP*(xrn/ ' Kn) ■ The rescaling factor X m converges to a/2 for large m, 
exponentially fast with a decay length £ = 3. 

It is an interesting question whether similar observations can be made on 
other integer recurrences. In this paper, I introduce and study the recurrence 



Its definition is not too different from that of Conway's sequence a(n), defined 
through 



The a(n)-sequence has been investigated by Hofstadter, Conway and others 
at various times since about 1975 @, ||. Conway discovered many of its 
properties. A cash prize that he offered for information about its asymptotic 
growth was won by Mallows f|. See also for a detailed study of a{n). 

Conway's sequence has a lot of fascinating properties. However, it behaves 
in a regular and completely predictable way. In contrast, D{n) develops 
chaotic and irregular patterns, separated by smooth and predictable regions. 
The latter property underlines its close relation to the a{n) function. 

In section 2, I report on some non-statistical observations of properties of 
the .D-sequence. Section 3 is about some of its statistical properties. The sec- 
tion describes investigations of the step size distribution, scaling properties, 
and frequency counting. Section 4 complements the study of the D-sequence 
by an investigation of two further chaotic recurrences that might be called 
chaotic cousins of the Hofstadter sequence. All sequences studied share var- 
ious statistical properties. This suggests that they belong to a common 
universality class. Because of its clear structure the D-sequence seems to be 
a natural candidate for studies of this class. 



D{n) = D(D(n - 1)) + D(n - 1 - D{n - 2)) for n > 2 , 
D(1) = D(2) = 1. 



(2) 



a(n) = a(a(n — 1)) + a(n — a(n — 1)) for n > 2 , 
a(l) = a (2) = 1. 



(3) 
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2 Non-Statistical Observations 



Figure |2| shows the first 2048 members of the a- and .D-sequences. Both a 
and D are organized in "generations" of increasing length and stay in some 
neighbourhood of n/2. These facts become even more obvious when looking 
at 2 a(n) — n and 2 D(n) — n, see figure |[ 

In order to make the "genealogy" more precise, we define a generation 
number g(n) for each n > 1 through 

, x f if n = 1 , , 
9[n) ~\k if 2 k ~ l < n < 2 k for n > 1 . [ } 

As in H|, we interpret _D(n) as the sum of its mother at spot D{rii) and its 
father at D(n 2 ), with 

m = .D(n-l), 

n 2 = n -l- D{n - 2) . 1 J 

Tables |T] and |2] show the structure of the generations and the genealogy 
Inspecting an extended version of table [I], we make a number of observations, 
valid for generation k, with k > 5: 

CI For the first k — 2 members the function D takes the value 2 k ~ 2 . The 
(k — l)th element is 2 k ~ 2 + 1. (The first k — 1 members of a generation 
will be called head.) 

C2 For the last /c — 2 members the function D takes the value 2 fc_1 . The 
element just before the last k — 2 members takes the value 2 k ~ l — 1. 
(The last k — 1 members will be called tail.) 

C3 The last member of generation A; — 2 is simultaneously the mother of 
all head members and the father of the first head member. The fathers 
of the remaining head members are (in ascending order) the members 
of the head of generation k — 1. 

C4 The parents of tail members are tail members of generation k — 1. 

C5 The values of D(n) lie in the range [2 fe ~ 2 , 2 k ~ 1 }. 
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Figure 2: Graphs of a(n) and D(n). 




Figure 3: Graphs of 2a(n) — n and 2D(n) — n. 
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Table 1: Genealogy in the D-sequence. Head, body and tail of a 
generation are separated by horizontal lines. Note that tails are not 
defined for k < 5. 



6 



k n ri\ g{n\) n,2 5(^2) D(n) 



6 33 


16 


4 


16 


4 


16 


34 


16 


4 


17 


5 


16 


35 


16 


4 


18 


5 


16 


36 


16 


4 


19 


5 


16 


37 


16 


4 


20 


5 


17 


38 


17 


5 


21 


5 


18 


39 


18 


5 


21 


5 


18 


40 


18 


5 


21 


5 


18 


41 


18 


5 


22 


5 


18 


42 


18 


5 


23 


5 


18 


43 


18 


5 


24 


5 


19 


44 


19 


5 


25 


5 


21 


45 


21 


5 


25 


5 


23 


46 


23 


5 


24 


5 


21 


47 


21 


5 


23 


5 


20 


48 


20 


5 


26 


5 


24 


49 


24 


5 


28 


5 


25 


50 


25 


5 


25 


5 


26 


51 


26 


5 


25 


5 


28 


52 


28 


5 


25 


5 


27 


53 


27 


5 


24 


5 


26 


54 


26 


5 


26 


5 


30 


55 


30 


5 


28 


5 


30 


56 


30 


5 


25 


5 


29 


57 


29 


5 


26 


5 


30 


58 


30 


5 


28 


5 


30 


59 


30 


5 


28 


5 


30 


60 


30 


5 


29 


5 


31 


61 


31 


5 


30 


5 


32 


62 


32 


5 


30 


5 


32 


63 


32 


5 


30 


5 


32 


64 


32 


5 


31 


5 


32 



Table 2: Continuation of table |I[ 
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Figure 4: Graph of 2D(n) — n, together with ±(2a(n) — n). 







Figure 5: Graphs of 2aD k (n) - n, for k = 7, 8, 9, 10. 
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An interesting observation can be made when one plots together D(n) 
and a(n). Figure (| shows 2D(n) — n, together with ±(2a(n) — n). The 
latter two functions nicely model the "outer" boundary of the fluctuating 
D(n) in some neighbourhood of the generation boundaries. 

The close relation of a and D is also underlined by the following exper- 
iment: Use the a- recurrence to generate the first k generations of numbers. 
Then continue with the recursion relation of the D-numbers. Graphs il- 
lustrating the behaviour of the resulting function, to be called aDk(n), are 
shown in figure |5|, for k from 7 to 10. Plotted are 2aDk(n) — n, together 
with 2 a{n) — n. With increasing k the "chaotic" fluctuations get reduced 
and the function becomes very similar to a(n). It seems that one can in this 
way generate a large family of sequences with different "levels of chaos" . 

3 Statistical Properties 

In generation k, i.e. for 2 fc_1 < n < 2 k , the function D(n) takes values in the 
range 2 k ~ 2 < n < 2 k ~ 1 . It seems natural to plot y = D(n)/2 k ~ 1 in terms 
of x = (n - 2 k - 1 )/2 k - 1 . We have < x < 1, and y < 0.5 < 1. Plots of 
this type for generations 6 to 13 are shown in figure The similarity of 
the graphs suggests that there could be some statistical properties becoming 
independent of k when k becomes large. 

3.1 Step Size Statistics 

The function inside a given generation may be considered representing a 
random walk of 2 k ~ 1 — 1 steps, starting from 2 k ~ 2 and arriving at 2 k ~ 1 . It is 
interesting to look at the distribution of the step sizes. Let us define 

S{n) = D{n) - D{n-l) . (6) 

The square of the variance of this quantity is given by 

M{kf = {S{nf) k -{S{n))l, (7) 

where ((.))& denotes the average over the fc-th generation. Table |3| shows nu- 
merical results for ln2 M(k) for generations 13 to 25 and also the logarithmic 
ratios at = ln2(M(k)/M(k — 1)). hi2 denotes the logarithm with respect to 
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Figure 6: Rescaled graphs of D(n), for generations 6 < k < 13. 
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Table 3: Variances M(k) and ratios a& = \n2{M{k)/M{k — 1)). 



base 2. The results for the latter quantity converge to 0.88(1). We conclude 
that 

M(k — 1) ' 1 j 

with a = 0.88(1). This exponent is consistent with the one found for the 
Hofstadter Q(n) @. 

Figure [7] shows a histogram p* of the variable x = S(n)/2°- 88( - k - 1 \ for 
k = 24 and k = 25, plotted on top of each other. The two histograms match 
nicely. The statistical distribution for k = 25 is plotted on a logarithmic 
scale in the lower part of the figure. As was the case with the distribution 
function of suitable Q-number observables, the tails can be nicely fitted with 
a properly rescaled error function erfc, defined through 

2 r°° 

erfc(x) = -= / dt exp(-t 2 ) . (9) 
\/7r Jx 



In it was observed that the probability density p m (x m ) of the rescaled 
difference x m = {Q{n) — Q(n — m))/n a was (up to a rescaling) with high 
precision identical with the distribution p* of (Q(n) — n/2)/n a , i.e., 

) = XmP*{x m /X m ) . (10) 

A similar type of scaling applies here. We define 

x m = (D(n)-D(n-m))/2 a{k - 1) . (11) 
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Figure 8: C D = \\ 2 m - 1.57| (full lines), C Q = |A^-2| (dotted lines), 
and the function exp(— m/3). 

Note that in the present case p* is the distribution of x\. One observes 
validity of eq. ( |i0|) with very good precision for m > 2. One can determine 
the A m from the second moments, 

\2 _ (^m) ~ ( X m) r-.cy\ 

They converge against A 2 ^ ~ 1.57. Looking at 

as function of m, we observe a striking similarity with the corresponding 
function for the Hofstadter sequence, see figure |[ The ups and downs in 
both cases are very similar. The decay goes like exp(— m/3). 

3.2 Numbers Left Out and Frequency Counting 

It was observed by A. K. Yao that the Q-sequence seems to leave out infinitely 
many numbers 0. 
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Table 4: Relative frequency r(M) of numbers in Ik that are generated 
by D exactly M times. The last column gives estimates for the r-ratios 
of the sequences Q, F\q, and Fu. The latter two recurrences will be 
introduced in section 5. 



The D-function maps generation k, i.e. the range [2 fc_1 + 1,2 ], to the 
interval Ik = [2 fc ~ 2 , 2 h ~ 1 ]. We consider the question which fraction r(M) 
of the 2 fc ~ 2 + 1 numbers in are generated exactly M times. It turns 
out that these fractions converge with increasing k. Table | shows r(M), 
M < 6, for k = 23 and k = 24. The D-function omits some 14 % of all 
numbers. The table also shows r(M) for the sequences Q, F w , and F n . The 
latter two sequences are close relatives of the Hofstadter sequence and will 
be introduced in section 4. There is a fair agreement of the ratios r(M) for 
all the four sequences. Figure ||] shows r(M) for M < 16 in generations 23 
and 24. Only a small deviations between the two sets of numbers is seen for 
larger M. 

Figure [10] shows a plot of the i-th left-out number in Ik, rescaled by a 
factor 2 fc_1 . The x- variable is i divided by the length of interval Ik- The 
graphs are for k = 16 and 17. The difference between the two curves is 
already small. Of course, such graphs can also be generated for M ^ 0. 
They look similar. 
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Figure 9: r(M), for k = 23 (+) and fc = 24 (x). 
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Figure 10: i-th left-out number in rescaled by a factor 2 fe_1 , for 
k = 16 (full line) and fe = 17 (dotted line). The x-variable is i divided 
by the length of interval i^. 
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4 Two Cousins of Hofstadter's Sequence 



It is natural to generalize the recurrence ([!]) by introducing constant shifts % 
and j in the arguments on the right hand side: 

Fij(n) = Fy(ni) + i^(ra 2 ) for n > 2 , 

F u (l)=F y (2) = l, 

ni = n — i — Fij(n — 1) , 

n 2 = n- j - F,ij(n - 2) . 

Of course, one has to check whether the recursion (together with given initial 
conditions) leads to a well-defined sequence for all n. Ill-definition occurs if 
there exists an n such that either n\ or n 2 is outside of [l,n — 1]. It turns 
out that definition (14) is ill-defined except for the cases ij = 00, 01, 10, and 



11, where numerical evidence for n up to several millions makes consistency 
problems rather unlikely Note that Fqq = Q. The sequence with ij = 01 
seems to have a simple regular structure, very similar to Tanny's sequence 
Pj. The other two cousins, F 10 and F n , look chaotic. A graph of the first 



2000 elements of F 00 , -F 10 , and F n is shown in figure 11. 



4.1 Statistical Properties 

We consider the sequences Fij(n) = Fij(n) — n/2. Again we study the vari- 
ances M(k), defined through 

M{kf = {F t0 {nf) k -(F %J {n))l, (15) 

where ({-))k denotes the average over intervals [2 fc - x + 1, 2*]. Table § shows 
the results for = log 2 (M(k)/M(k — 1)), where k < 25. We estimate for 
a = linifc^oo = 0.88(1) for ij = 00 (Hofstadter sequence), 0.86(1) for ij = 10, 
and 0.89(1) for ij = 11. It seems that the exponent for F 10 is smaller than 
that for the other sequences. Note however, that the given error bars could 
be underestimated. There are still fluctuations in table |5|, and we cannot 
strictly rule out the possibility that the exponents of the three sequences 
agree. 



Figure 12 shows the statistical distribution functions of the quantities 



Fij(n)/n a } where the a's are taken from the last line of table |5|. The binning 
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Figure 11: Graphs of the sequences Fqq = Q, Fiq, and F\\. 
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Table 5: Logarithmic variance ratios for Fqq, Fiq, and F\i. 



was done over periods [2 fc_1 ,2 fc ] for the 10 and 11 sequences. For Fqq the 
generation structure requires intervals [2 fc_L5 , 2 fc_0 - 5 ]. The distributions for 
the different fc's agree nicely. The plot shows the k = 24 results. The 
function with the highest peak belongs to ij = 00, the Fn-numbers have the 
broadest distribution. In contrast to the 00-distribution which (as the D- 
distribution) goes like exp(— cx 2 )/x for large x, the 10- and 11-distributions 
can be fairly well approximated by Gaussians. It is an interesting question 
whether the various behaviours can be understood and modelled. It seems 
natural to try a fit with limiting distributions of random walks. Narrow non- 
Gaussian distribution can in principle be generated by sub-diffusive random 
walks |IIJ. The observed asymptotics ~ exp(— cx 2 )/x, however, does not 



seem to be compatible with sub-diffusion. 

Again we observe scaling, if we look at the distributions of 

x m = Fij(n - m) - Fij{n) . (16) 

More precisely, the probability density of x m , m > 2 is up to a rescaling 
the same as that of x m -\. For ij = 11 one can detect some small scaling 
violations for the first 2 values of m. The approach of the A m factors to their 
asymptotic value is the same for all three F-sequences, and very similar to 



18 




Figure 12: Statistical distributions of Fij(n)/n a , for ij = 00 (highest 
peak), 10, and 11 (broadest). 



that of the .D-sequence. The convergence is again governed by a correlation 
length of 3. 

4.2 Correlation Functions 

For all three F-sequences we define a variable a n through 



<7„ 



+1 if F(n) > n/2, 
— 1 else . 



(17) 



Then we "measure" the 2-point correlator 

G(m) = (a n a n _ m ) - (a n ) 2 (18) 

over the range [2 16 , 2 24 ]. The results for |G(m)| are shown in figure |13[ The 
lower part of the figure shows |G(m)| on a logarithmic scale, together with 
the functions Cq and Cd of figure [8[ The surprise is not only that the 
correlators of the three F-sequences seem to be identical. They also have 
a striking similarity with the functions describing the decay of the rescaling 
factors A m . 
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Figure 13: G(m) for ij = 00, 10, and 11 (top). The lower plot shows 
|G(m)| for the same three sequences on a logarithmic scale (lower three 
graphs) together with the functions Cd and Cq of figure || (upper two 
graphs). 
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4.3 Frequency Counting 



Results for the relative frequencies of numbers n occuring M times in the 
F-sequences were already given in table @. They agree fairly well with those 
for D and Q. 

Summary and Conclusions 

In this paper, a chaotic cousin of Conway's sequence was introduced and 
studied. Its statistical properties showed some intriguing similarities with 
the Hofstadter sequence Q and also with the two cousins F 10 and Fq: 

• All the four sequences studied have (to the given precision) the same 
exponent a, governing the increase of variance with increasing n or k. 
(The value for F w seems to be a little bit lower, agreement can however 
not be excluded.) 

• The probability densities obey scaling. The rescaling parameter follows 
a characteristic convergence, governed by a correlation length 3. 

• The correlation function G(m) is identical for all three F-sequences. It 
also decays with correlation length 3, and in a way very similar to the 
behaviour of the A m factors. 

• The relative frequencies of numbers occurring exactly M times in the 
sequence seems to be the same for all the four sequences. 

In summary, the F-numbers and the three F-sequences have a lot of 
common structure. One might say that they share a universality class. A 
precise definition of such a class is, however, still lacking. 

The .D-sequence is unique insofar, as it has a regular generation structure 
with smooth interplays inbetween. This could make it a candidate for studies 
aiming at some rigorous results about the chaotic recurrence relations. 

It is presently an open question how much one can learn from the rela- 
tion of the F-recurrence with the "solved" a-sequence. That there is some 
deep relation is suggested by the apparent similarity of the two sequences in 
the regions between the generations. The experiments with seeding the D- 
recurrence with k generations of a-numbers (see the end of section 2) could 
be a first step towards a better understanding of this relation. 
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Whenever one observes the phenomenon of universality in a model, one 
is tempted to look for realizations of the same universality class in nature. 
It is an interesting question whether recurrences of the type studied in this 
article represent real physical processes or might be of use in the study of 
some dynamical system occuring in real life. A physical picture (e.g., in 
terms of random walks in some bizarre surrounding) could perhaps help to 
better understand some of the interesting properties of these sequences. 



Cash Prize Offered 

I offer a cash prize of $100 to the first providing a rigorous proof of the claims 
CI, C2, and C5 stated in section 2. 
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